Quantitative cross-species comparison of GO annotations: advantages and limitations of semantic similarity measure

نویسندگان

  • Olivier Dameron
  • Charles Bettembourg
  • Léa Joret
چکیده

We highlight the various kinds of difficulties arising when comparing two sets of GO annotations using either the intersection of sets of annotations or semantic similarity measures. We illustrate our approach by comparing the GO annotations of Apolipoprotein A-5 (apoa5) and Apolipoprotein A-1 (apoa1) respectively between human (hsa) and mice (mmu). Apoa5 is involved in similar biological processes in the two species [1], whereas Apoa1 is known to be significantly different [2]. For each gene product, we retrieved the annotations for each species using the GOA database from the EBI. We also retrieved the evidence codes and modifiers of these annotations in order to take negation into account [3, 4]. The Gene Ontology provided the hierarchy between the annotations. We used the daily version.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluating GO-based Semantic Similarity Measures

Motivation: While several efforts have been made in measuring GO-based protein semantic similarity, it is still unclear which are the best approaches to measure it and furthermore whether electronic annotations should be used. Results: We studied the behaviour of 8 distinct semantic similarity measures as function of sequence similarity with and without electronic annotations. We found that 5 o...

متن کامل

Bi-directional semantic similarity for gene ontology to optimize biological and clinical analyses

BACKGROUND Semantic similarity analysis facilitates automated semantic explanations of biological and clinical data annotated by biomedical ontologies. Gene ontology (GO) has become one of the most important biomedical ontologies with a set of controlled vocabularies, providing rich semantic annotations for genes and molecular phenotypes for diseases. Current methods for measuring GO semantic s...

متن کامل

Gene Ontology-based Semantic Similarity Measures

Quantitative measure of functional similarity between gene products is important for post-genomics study. The similarity measures may be used to validate high-throughput protein interaction data, help the development of new pathway modelling tools and clustering methods, and enable the identification of functionally related gene products independent of homology [Guo et al., 2006, Schlicker et a...

متن کامل

A novel insight into Gene Ontology semantic similarity.

Existing methods for computing the semantic similarity between Gene Ontology (GO) terms are often based on external datasets and, therefore are not intrinsic to GO. Furthermore, they not only fail to handle identical annotations but also show a strong bias toward well-annotated proteins when being used for measuring similarity of proteins. Inspired by the concept of cellular differentiation and...

متن کامل

GOSemSim: an R package for measuring semantic similarity among GO terms and gene products

SUMMARY The semantic comparisons of Gene Ontology (GO) annotations provide quantitative ways to compute similarities between genes and gene groups, and have became important basis for many bioinformatics analysis approaches. GOSemSim is an R package for semantic similarity computation among GO terms, sets of GO terms, gene products and gene clusters. Four information content (IC)- and a graph-b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009